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MOLECULAR MARKERS FOR DIAGNOSING 



HEPATOCELLULAR CARCINOMA 



Reference to Government Grant 



This invention was made in the course of research sponsored 



5 by the National Institutes of Health grants CA48656 and CA66971. The 
U.S. Government has certain rights in the invention. 

Background of the Invention 

Primary hepatocellular carcinoma (HCC) is one of the most 
common tumors seen in certain areas of the world. In Asia and sub- 

10 Saharan Africa it has an annual incidence rate of 500 cases per 100,000 
population. In the United States and Europe, HCC accounts for 1 to 2 
percent of tumors seen at autopsy (Podolsky, D.K. and K.J. Isselbacher. 
1 994. Harrison 's Principles of Internal Medicine, pp. 1 496-1 497) . There are 
risk factors for HCC, however, that can lead to a large increase in the 

1 5 likelihood that tumors will develop. For example, HCC is usually associated 
with a cirrhotic liver, making alcoholics more likely to develop these tumors. 

The increased incidence of HCC in Asian and African 
populations and elsewhere has been attributed to the high incidence of 
chronic infection with hepatitis B virus (HBV) and hepatitis C virus (HCV). 

20 These chronic infections can lead to hepatitis and cirrhosis which are the 
most common risk factors for HCC. The link between HBV infection and 
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HCC is well established. Studies in Asia have shown that the incidence of 
this form of cancer over time is increased 100-fold in individuals with 
evidence of HBV infection as compared to non-infected controls (Podolsky, 
D.K. and KJ. Isselbacher. 1994. Harrison's Principles of Internal Medicine, 
5 pp. 1496-1497). More recent work in Europe and Japan has shown that 
HCV is also linked to an increased risk of HCC. In fact, any agent or factor 
that contributes to chronic, low-grade liver cell damage would make liver 
cell DNA more susceptible to damage and genetic alterations which can 
lead to carcinogenesis. The mechanisms and steps responsible for the 

10 development of HCC, however, have not been fully elucidated. 

The finding that HBV makes a genetic contribution to the 
development of HCC (Seegeretal. 1991 . J. Virol. 65:1673-1679) suggests 
that one or more virus encoded proteins may play a role in 
hepatocarcinogenesis. Other data suggests that hepatitis B x antigen 

15 (HBxAg) contributes to the pathogenesis of HCC. HBxAg transforms a 
mouse hepatocyte cell line both in vitro and in vivo (Hohne, M. et al. 1990. 
EMBO J. 9:1137-1145; Seifer, M. et al. 1991. J. Hepatol. 13:S61-S65). 
HBxAg binds to and functionally inactivates the tumor suppressor p53 
(Feitelson, M.A. et al. 1993. Oncogene 8:1109-1117; Wang, X.W. et al. 

2 0 1994. Proc. Natl. Acad. Sci. USA 91:2230-2234; Truant, R. et al. 1995. J. 
Virol. 69:1851-1859; Takeda, S. et al. 1995. J. Cancer Res. Clin. Oncol. 
121:593-601). HBxAg/p53 staining and complex formation has also been 
shown to correlate with the development of liver tumors in a X transgenic 
mouse model with sustained high levels of HBxAg expression (Kim, CM. 

25 etal. 1991. Nature 351:31 7-320; Koike, K.etal. 1994. Hepatology^ 9:810- 
819; Ueda, H. et al. 1995. Nature Genetics 9:41-47). 

It has previously been shown that HBxAg is a trans-activating 
protein (Twu, J.S. and R.H. Schloemer. 1987. J. Virol. 61:3448-3453; 
Rossner, M.T. 1992. J. Med. Virol. 36:101-117; Henkler, F. and R. Koshy. 

30 1996. J. Viral Hepatitis 3:109-121). Even though virus DNA fragments 
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integrated into HCC cells often contain the X region (Matsubara, K. and T. 
Tokino. 1990. MoL Biol. Med. 7:243-260; Unsal, H. etal. 1994. Proc. Natl. 
Acad Sci. USA 91:822-826) and HBxAg made from these integrated 
sequences has transactivating-activity, it is not clear that this action is 
responsible for transformation (Luber, B.L.etal. 1996. Oncogene 12: 1597- 
1608). A variety of studies have described differences in gene expression 
which distinguish tumor (HCC) form nontumor (liver) cells (Begum, N. A.. 
M. Mori, T. Matsumata, K Takenaka, K. Sugimachi, and G. F. Barnard. 
1995. Hepatology 22:1447-1455; Darabi, A., S. Gross, M. Watabe, M. 
Malafa, and K. Watabe. 1995. Cancer Lett 95:153-159; Inui, Y., S. 
Higashlyama, S. Kawata, S. Tamura, J. -I. Miyagawa, N. Taniguchi, and Y. 
Matsuzawa. 1994. Gastroenterology 107:1799-1804; Kim, S. O., J. G. 
Park, and Y. L Lee. 1996 Cancer Res. 56:3831-3836; Ohmachi, Y., A. 
Murata, T. Yasuda, K Kitagawa. S. Yamamoto, M. Monden, T. Mori, N. 
Matsuura, and K. Matsubara. 1994. J. Hepatol. 21:1012-1016; Su, W., J. 
F. Liu, S. X. Zhang, D. F. Li, and J. J. Yang. 1994. Hepatology 19:788-799; 
Uekt, T., J. Fujimoto, T. Suxukt, H. Yamamoto, and E. Okamoto. 1997. 
Hepatology 25:862-866; Yamashita, N. f H. Ishibashi, K. Hayashida, J. 
Kudo, K. Takenaka, K. Itch, and Y. Niho. 1996. Hepatology 24: 1437-1 440; 
Zhou, M. X., M. Watabe. and K. Watabe. 1994. Arch. Virol. 134:369-378). 
However, no indication has been given whether any of these genes are 
turned on or off by HBxAg. 

One of the problems associated with any type of cancer is 
ensuring early detection and risk factor screening so that disease can be 
more successfully treated. Detection of HCC may escape clinical 
recognition because of the presence of other active disease processes, 
such as hepatitis or cirrhosis One screening tool has been alpha 
fetoprotein levels, where levels greater than 500 pg/L are found in 70-80% 
of patients with HCC (Podolsky, D.K. and KJ. Isselbacher. 1994. Harrison's 
Principles of Internal Medicine, pp. 1496-1497). The most common 
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diagnostic tools are imaging with ultrasound, which can only detect the 
presence of visible tumors, and liver biopsy. Neither of these diagnostic 
tools is able to screen individuals for the risk of disease before tumors 
develop. In biopsy, it can be difficult to distinguish large cirrhotic nodules 
5 from well-differentiated HCC or low-grade dysplastic nodules from HCC. 

Clearly, there is a need for better methods of early diagnosis, 
as well as risk screening. Criteria for judging the usefulness of HCC 
screening methods were recently reviewed by Collier and Sherman, 1988. 
Hepatology 27:273-278. 

10 Summary of the Invention 

The invention is a method for detecting hepatocellular 
carcinoma in liver tissue of a patient. A liver tissue sample is obtained from 
the patient, and the level of expression of one or more marker genes in the 
sample is assessed. The marker genes are differentially expressed in 

15 HBxAg[+] cells as compared with HBxAg[-] cells. A reduction in the level 
of expression of one or more marker genes in the sample as compared to 
the expression level in noncancerous liver tissue is indicative of 
hepatocellular carcinoma in the sample. 

According to an embodiment of the invention, the marker 

2 o gene is selected from the group of genes expressing RNA transcripts which 
hybridize under conditions of high stringency to a nucleic acid probe 
selected from the group consisting of SEQ ID NO:1 , SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO;5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, and SEQ ID NO:10. More particularly, the marker 

25 gene is selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID 

NO:27; 

a gene which encodes the polypeptide of SEQ ID 
NO:28; 
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a gene which encodes the polypeptide of SEQ ID 

NO:29; 

a gene which encodes the polypeptide of SEQ ID 
NO:29; 



human tubulin-folding cofactor E gene; 

human myeloblast KIAA0132 gene; and 

the human fetal heart gene, the cDNA of which is 

identified as GenBank accession number AA047006. 

An example of high stringency hybridization conditions is 



hybridization at4XSSC at65°C, followed by washing in 0.1XSSC at65 c C 
for one hour. Another example of high stringency hybridization conditions 
is hybridization in 50% formamide, 4XSSC at 55°C. 



According to one embodiment of the invention, the step of 



assessing the level of expression of the marker gene in the sample 
comprises contacting the sample with one or more probes which detect 
mRNA which is differentially expressed in HBxAg[+] cells as compared with 
HBxAg[-] cells. 



According to another embodiment of the invention, the step 



of assessing the level of expression of the marker gene by in the sample 
comprises assessing the level of expression of marker protein encoded by 
one or more marker genes. Detection of marker protein is accomplished 
by contacting the sample with one or more antibodies which bind marker 
proteins. 



According to another embodiment, the invention provides a 



method for diagnosing hepatocellular carcinoma comprising the steps of 
obtaining a liver tissue sample from a patient, and assessing the level of 
expression of one or more marker genes in the tissue sample, which 
marker genes are differentially expressed in HBxAg[+] cells as compared 



5 



hu-s/7/'; 
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with HBxAg[-] cells. The reduction of detectable expression of one or more 
marker genes in the sample is indicative of hepatocellular carcinoma. 

In yet another embodiment, the invention provides a method 
for identifying patterns in gene expression in a biological sample that are 
altered by hepatitis B x antigen comprising the steps of 

obtaining a biological sample; 

contacting said sample with a probe which detects an 
mRNA which is differentially expressed in HBxAg[+] cells as compared with 
HBxAgH cells; and 

detecting expression of a gene encoding said mRNA 
detected by the probe. 

Alternatively, the steps for identifying alterations in gene 
expression patterns in the biological sample comprise 

contacting said sample with an antibody which detects 
a protein which is differentially expressed in HBxAg[+] cells as compared 
with HBxAg[-] cells; and 

detecting expression of a gene encoding the protein 
detected by the antibody. 

Abbreviations and Definitions 



Abbreviations 


"ABC" 


avidin-biotin-peroxidase complex 


"bp" 


base pair 


"CAT" 


chloramphenicol acety transferase 


"ISH" 


in situ hybridization 




deoxythymidine triphosphate 


"HBxAg" 


hepatitis B x antigen 


"HBsAg" 


hepatitis B surface antigen 


"HBV" 


hepatitis B virus 


"HCC" 


hepatocellular carcinoma 
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"PCR" polymerase chain reaction 
"RT" reverse transcriptase 

"SSC" standard saline citrate solution (0.1 5M saline 
containing 0.01 5M sodium citrate, pH 7) 

5 B. Definitions 

"Expression" means, with respect to a gene, the realization 
of genetic information encoded in the gene to produce a functional RNA or 
protein. The term is thus used in its broadest sense, unless indicated to 
the contrary, to include either transcription or translation. 

o "Expression level", with respect to a gene means a relative 

expression level as determined by comparison with the expression level of 
the gene in noncancerous tissue. An expression level may be "assessed" 
visually in a sample with the aid of a microscope, such as by considering 
the intensity of a stain for protein encoded by the gene of interest, or by 

s considering the relative number of stained versus unstained cells in the 
sample. 

"Hybridization" means the Watson-Crick base-pairing of 
essentially complementary nucleotide sequences (polymers of nucleic 
acids) to form a double-stranded molecule, 
o "Marker gene" means a gene which is differentially 

expressed in HCC tumor versus non-tumor tissue. 

"Marker protein" means a protein which is encoded by a 

marker gene. 

Detailed Description of the Invention 

5 The present invention is a method for determining whether 

tissue from a biopsy represents HCC. based on detection of gene 
expression patterns in cells. Studies of tumor and non-tumor pairs from 
patients demonstrate the differential expression of certain genes in tumor 
versus non-tumor tissue. The genes are expressed in non-HCC tissue, but 
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expression is substantially reduced or undetectable in HCC tumor tissue. 
Thus, a reduction in expression of one or more of the marker genes in the 
tissue sample is diagnostic for the presence of HCC in the patient sample. 
At the individual cell level, one or more of the marker genes may not be 
5 expressed or may be characterized by reduced expression level, such that 
the expression of the gene in the tissue sample as a whole is reduced. The 
identification of such molecular markers provides a method for diagnosing 
HCC without relying on tissue morphology alone. This is the first time that 
molecular markers associated with chronic HBV infection have been shown 

io to be useful in the diagnosis of HCC. 

The diagnostic marker genes were identified by manipulation 
of HepG2X cells. HepG2 is a differentiated ceil line derived from a human 
hepatoblastoma. The cell line HepG2X was generated by infection of 
HepG2 cells by replication defective recombinant retroviruses encoding the 

15 full length HBxAg polypeptide. HepG2CAT cells were generated in the 
same manner by substituting the bacterial CAT gene for the HBV X gene 
in the transfection vector. The HepG2X cells express the HBV X antigen 
(HBxAg[+]) f while HepG2CAT cells do not (HBxAg[-]). 

Genes whose expression were either turned on or off in the 

20 presence of the hepatitis B x antigen (HBxAg) in HepG2 cells are identified 
by PCR select cDNA subtraction. Briefly, the method consists of isolating 
whole cell RNA from HBxAg [+] and [-] HepG2 cells. Methods and kits for 
performing PCR select cDNA subtraction are well-known and commercially 
available, e.g., from Clontech, Palo Alto, CA. The RNA from HepG2X cells 

25 is subtracted from those in HepG2 cells, providing RNAs expressed in 
HepG2X cells but not in HepG2 cells. The RNAs were then reverse 
transcribed into DNA and then PCR amplified using random primers In 
order to obtain RNAs expressed in HepG2, but not HepG2X cells, the 
opposite subtraction is carried out. These RT/PCR fragments were then 

3 o cloned and either partially or fully sequenced. 
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Accordingly, equivalent amounts of poly(A)* RNA were 
isolated from confluent cultures of HepG2X and HepG2CAT cells and 
subjected to PCR select cDNA subtraction. DNA strands were individually 
sequenced from every clone, and the results for each compared to entries 
5 in GenBank and other related databases (Table 1 , below). The PCR select 
cDNA subtraction generated gene fragments from up to eight different 
cellular genes that were detected in HepG2X cells but not in HepG2CAT 
cells (L4, L7, L8 r L11, L12, L15, L16and L19). Five of these (L7, L8 ? L12. 
L1 6 and L1 9) had at least 89% homology with fragments of known products 

10 from GenBank (Table 1). Interestingly, three of the five sequences (L7, 
L12, and L19) had homology with factors upregulated in fetal tissues, 
suggesting that they may have some growth regulatory functions. In 
addition, two fragments (C1 and C2) were apparently present in 
HepG2CAT cells but absent in HepG2X cells. Hence, up to ten genes 

15 were differentially expressed in HepG2X compared to HepG2CAT cells. 
In the case of the transcripts hybridizing to the L fragments, the clones 
represent fragments of genes whose expression is activated in HBxAg[+] 
cells compared to HBxAg[-] cells. In the case of transcripts hybridizing to 
the C fragments, the clones represent genes whose expression is 

20 suppressed in HBxAg[+] cells compared to HBxAg[-] cells. The fragment 
size given in Table 1 is considered approximate, as size was estimated 
visually from gels. 
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Table 1: Differentially expressed genes in HBxAg[+] and [-] Hep 
G2 cells 



clone insert GenBank Search 



size %homology 
("bp) Match (and Accession #) in overlap 



HBxM B Clones 

L7 690 human fetal liver cDNA clone (H49417) 95.7% in 440 bp 

L8 b 220 human tubulin-folding cofactor E cDNA 1 00% in 45 bp 

10 (U61232) 

L12 320 human 40S ribosomal protein 
S15A(P48149) 

L16 b 180 human myeloblast KIAA01 32 gene 99% in 65 bp 

L1 9 250 human fetal heart cDNA (AA026758) 99% in 1 52 bp 

15 L4 h 1700 none 

L11 580 none 
L15 1580 none 



HBxf-1 a Clones 

C2" 620 human suil (L26247) 
2C C1 670 none 



The clones represent fragments of genes whose expression is activated 
(L4,L7,L8,L1 1 ,L12,L15,L1 6,L1 9) or suppressed (C1 ,C2) in HBxAg[+] compared to HBxAgf-] 
cells 

b Probes whose sequences share considerable homology with sequences independently 
found in tumor compared to nontumor cells. 



The cDNA fragments obtained from subtraction hybridization (Table 
1 ) were used as probes for ISH of HepG2X and HepG2CAT cells, to verify 
that the probes obtained from PCR select cDNA subtraction actually 
represented differentially expressed genes in HepG2 compared to HepG2X 
cells. In all cases, the L probes hybridized to HepG2X cells. Little or no 
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signal was observed in HepG2CAT cells. In contrast, the C probes 
demonstrated strong hybridization in HepG2CAT cells, but little or no signal 
in HepG2X cells. Thus, in all cases, the probes obtained from PCR select 
cDNA subtraction actually reflected differences in gene expression 
between HepG2X and HepG2CAT cells. 

The cDNA fragments were either partially (L4, L7, L8, L15, L16and 
C2) or completely (L1 1 , L1 2. L19 and C1 ) sequenced. The sequences are 
as follows: 



Table 2: Nucleotide Sequences of Fragments of Differentially 
Expressed cDNA 

cDNA Fragment Fragment Nucleotide Sequence 

L7 SEQ ID IMO:1 

L8 SEQIDNO:2 

L12 SEQIDNO:3 

L16 SEQ IDNO:4 

L19 SEQ IDNO:5 

L4 SEQ ID NO:6 

" L11 SEQ IDNO:7 

L15 SEQ IDNO:8 

C2 SEQ ID NO:9 

C1 SEQ IDNO:10 



In order to further study the structure and function of the protein 
encoded by the C2 mRNA, the full length cDNA containing the C2 
sequence was obtained (from HepG2CAT cells) by 5* and 3' rapid 
amplification of cDNA ends (RACE) PCR using the Marathon™ cDNA 
Amplification Kit (Clontech, Palo Alto, CA). Briefly, one 3' and one 5' gene 
specific primers were synthesized. PCR was performed using these 
primers together with an adaptor primer to obtain the 3' or 5 r cDNA specific 
products in separate amplification reactions. The products were cloned 
into pT7Blue T (Novagen, Inc., Madison, Wl) and sequenced. The 
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appropriate 3' and 5' gene specific fragments were then digested with 
suitable restriction enzymes and cloned into pcDNA3 (Invitrogen, San 
Diego, CA) at the chosen site(s), and the integrity of the full length clone 
verified by DNA sequencing. This resulted in a full length clone exactly 
1.35 kb in length, which encoded a small protein of 113 amino acids near 
its 5* end that has 100% homology with the human translation initiation 
factor, hu-suil. The C2 probe spans bases 903-1350 of full length hu-suil 
cDNA 

Other than its regulatory role in translation initiation, the human suil 
protein does not appear to have any recognizable motifs which would 
suggest additional functions. These results indicate that the introduction 
of HBxAg results in the altered expression of a protein whose function is 
associated with the regulation of translation. Further, HBxAg may 
contribute to hepato-carcinogenesis, in part, by altering gene expression 
at the level of translation initiation. 

Additional full length cDNAs from differentially expressed genes 
containing fragments L4, L7, L11 and L12 were obtained in a similar 
manner to fragment C2. The cDNA containing fragment L12 encoded a 
protein of 130 amino acids having a 100% homology with the human 40S 
ribosomal protein S15A (Accession nos. P39027, P39031). Sequences of 
the full length cDNAs and corresponding gene names are set forth in Table 
3: 
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Table 3: Nucleotide Sequences of Full-length Differentially 
Expressed cDNAs 



cDNA 


Full Length 


Gene Name 


GenBank 


Translated 


Fragment 


cDNA Seq. 




Accession 


Protein Seq. 




(SEQ ID NO:) 




Number 


(SEQ ID NO:) 


14 


1 1 


unknown 




27 


L7 


12 


human feta! 
liver cDNA 


H49417 


28 


L11 


13 


unknown 




29 


L12 


14 


human 40S 
ribosomal 
protein 
S15A 


P48149 


30 


C2 


15 


bu-suil 


L26247 


31 



!0 Experiments were performed to detect hu-sui1 transcripts in tumor 

and nontumor tissues from HBV infected patients. A panel of 
tumor/nontumor tissue pairs from a group of patients were analyzed by ISH 
using the C2 probe. Among this group, 14 patients were from South Africa, 
while the remaining 23 were from mainland China. The results (Table 5, 

15 Example 3, below) show that hu-suil mRNA is easily detectable in 
nontumor tissue from both groups, but that it is rarely present in tumor 
tissues from the same patients Thus, hu-sui1 is differentially expressed 
in tumor vs. non-tumor tissue. 

ISH was performed with the full set often individual probes (L4, L7, 

20 L8, L11, L12, L15, L16, L19, CI and C2) on tumor/nontumor paired 
samples from five HBV carriers with HCC, and on normal uninfected liver 
from two individuals The probes detected transcripts that were 
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preferentially expressed in nontumor, compared to tumor tissues, in most 
cases (Table 6, Example 4, below). These results were not due to 
differences in the ability of the tumor tissue to uptake probe, since tumor 
cells from three of the five HCC patients hybridized strongly to an alpha 
5 fetoprotein probe. 

For diagnosis of HCC, a sample of liver tissue is removed from an 
individual by conventional biopsy techniques which are well-known to those 
skilled in the art. Typically, the test subject will be an HBV-infected 
individual. The sample is generally collected by needle biopsy. 

10 Procedures for liver needle biopsy are well-known in medicine. A mass 
may be apparent from either tactile examination of the patient, or upon 
imaging such as by ultrasound. The needle biopsy should be taken at or 
near the site of the mass. Ultrasound guided percutaneous fine-needle 
biopsy procedures are known. See, e.g., Polakow, 

15 Hepatogastroenterology 45:1 829-30 (1998). The biopsy sample may also 
be taken in connection with a surgical procedure in which the liver becomes 
accessible. 

The expression level of the marker gene may serve as a convenient 
molecular marker to replace or augment conventional liver tissue 

2 0 examination, which largely relies on subjective criteria. This form of 
"molecular-based" diagnosis can be performed more consistently than 
conventional pathological examination which is based upon subjective 
evaluations by expert pathologists. 

Detecting the expression of the marker gene in the tissue sample 

25 comprises detecting RNA transcripts, particularly mRNA transcripts in the 
sample tissue, or detecting the corresponding marker gene product 
(protein) in the sample tissue. Preferably, the presence of the marker 
protein in the sample tissue is detected by an immunoassay whereby an 
antibody which binds the marker protein is contacted with the sampled 

30 tissue. 
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Typically, a portion of noncancerous liver tissue will be removed with 
the purported tumor tissue during biopsy. The noncancerous (i.e., non- 
tumor) cells will express the marker gene, and will provide a positive signal 
for the absence of HCC. Hence, the noncancerous cells in the biopsied 
5 sample will serve as a convenient positive control. 

It may be desirable in some cases to compare the assay results 
against a control sample comprising liver cells from non-tumor liver tissue 
from the test subject, or non-tumor liver tissue from another (HBV-infected) 
individual. The non-tumor sample should test positive for the expression 
10 of the marker gene. 

Tissue samples may be considered as HCC-positive when the level 
of expression of one or more marker genes is reduced in the tissue sample 
as a whole, compared to the expression level in noncancerous liver tissue. 
The overall reduction of marker gene expression in the liver sample may 
15 arise from a reduced but still detectable expression level in at least a 
portion of the cells of the sample, a complete loss of marker gene 
expression in some cells, or a combination of both. The former may be 
apparent as a general lessening of stain intensity when the sample is 
treated with a stain for cells which express the marker gene. The latter 
20 may be observed as a complete absence of stain in the affected cells. 
These observations can be made visually with the aid of a microscope. 

Methods of detecting mRNA transcripts of a particular gene in cells 
of a tissue of interest are well-known to those skilled in the art. According 
to one such method, total cellular RNA is purified from the effected cells by 

2 5 homogenization in the presence of nucleic acid extraction buffer, followed 

by centrifugation. Nucleic acids are precipitated, and DNA is removed by 
treatment with DNase and precipitation. The RNA molecules are then 
separated by gel electrophoresis on agarose gels according to standard 
techniques, and transferred to nitrocellulose filters by, e.g., the so-called 

3 0 "Northern" blotting technique. The RNA is immobilized on the filters by 
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heating. Detection, and quantification if desired, of specific RNA is 
accomplished using appropriately labeled DNA or RNA probes 
complementary to the RNA in question. See Molecular Cloning: A 
Laboratory Manual, J. Sambrook ef a/., eds., 2nd edition, Cold Spring 
Harbor Laboratory Press, 1989, Chapter 7, the disclosure of which is 
incorporated by reference. 

More preferably, the mRNA assay is carried out according to ISH. 
Also known as xytologicai hybridization", the in situ technique involves 
depositing whole cells or tissues onto a microscope cover slip and probing 
the nucleic acid content of the cell with a solution containing radioactive or 
otherwise labeled cDNA or cRN A probes. The practice of ISH is described 
in more detail in U.S. Patent 5,427,916, the entire disclosure of which is 
incorporated herein by reference. 

The nucleic acid probes for the above RNA hybridization methods 
can be designed based upon the full length marker gene sequences 
described or referenced herein. Where the marker gene has yet to be 
identified with a known, full-length sequenced DNA, the corresponding 
cDNA fragment listed in Table 2 may be used as the probe. 

Methods for preparation of labeled DNA and RNA probes, and the 
conditions for hybridization thereof to target nucleotide sequences, are 
described in Molecular Cloning, supra, Chapters 10 and 11, incorporated 
herein by reference. The nucleic acid probe may be labeled with, e.g., a 
radionuclide such as 32 P, 14 C, or 35 S; a heavy metal; or a ligand capable of 
functioning as a specific binding pair member for a labeled ligand, such as 
a labeled antibody, a fluorescent molecule, a chemolescent molecule, an 
enzyme or the like. 

Probes may be labeled to high specific activity by either the nick 
translation method or Rigby et a/., J. MoL Biol. 113: 237-251 (1977) or by 
the random priming method, Fienberg ef a/., Anal Biochem. 132: 6-13 
(1983). The latter is the method of choice for synthesizing 32 P-iabeled 
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probes of high specific activity from single-stranded DNA or from RNA 
templates. Both methods are well-known to those skilled in the art and will 
not be repeated herein. By replacing preexisting nucleotides with highly 
radioactive nucleotides, it is possible to prepare 32 P-labeled DNA probes 
with a specific activity well in excess of 1 0 8 cpm/microgram according to the 
nick translation method. Autoradiographic detection of hybridization may 
then be performed by exposing filters on photographic film. 

Where radionuclide labeling is not practical, the random-primer 
method may be used to incorporate the TTP analogue 5-(N-(N-biotinyl- 
epsilon-aminocaproyl)-3-aminoallyl)deoxyuridine triphosphate into the 
probe molecule. The thus biotinylated probe oligonucleotide can be 
detected by reaction with biotin binding proteins such as avidin, 
streptavidin, or anti-biotin antibodies coupled with fluorescent dyes or 
enzymes producing color reactions. 

In situ hybridization is most conveniently carried out using a 
commercially available kit for labeling nucleic acid probes with, e.g. 
digoxigenenin/biotin as a label. One such kit is available from Oncor, 
Gaithersburg, MD. 

According to another embodiment of the invention, marker gene 
expression in cells of the patient tissue is determined by detecting the 
corresponding marker protein. A variety of methods for detecting and 
quantifying expression of proteins of interest exist, including Western 
blotting and immunohistochemical staining. The latter is preferred. 
Western blots are run by spreading a protein sample on a gel, using an 
SDS gel, blotting the gel with a cellulose nitrate filter, and probing the filters 
with labeled antibodies. With immunohistochemical staining techniques, 
a tissue sample is prepared, typically by dehydration and fixation, followed 
by reaction with labeled antibodies specific for the desired gene product. 
The antibodies may be coupled to a visually detectable label, such as 
enzymatic labels, flourescent labels, luminescent labels, and the like. 



WO 99/39200 



- 18- 



PO7US99/01894 



According to one embodiment of the invention, tissue samples are 
obtained from patients and the samples are embedded and serially 
sectioned at 3-5 (j per section. The sections are fixed, mounted and dried 
according to conventional tissue mounting techniques. The fixing agent 
may advantageously comprise formalin. The embedding agent for 
mounting the specimen may comprise, e.g. , paraffin. The samples may be 
stored in this condition 

Following deparaffinization and rehydration, the samples are 
contacted with an immunoreagent comprising an antibody specific for a 
marker protein of interest. The antibody may comprise a polyclonal or 
monoclonal antibody. The antibody may comprise an intact antibody, or 
fragments thereof capable of specifically binding marker protein. Such 
fragments include, but are not limited to, Fab and F(ab') 2 fragments. As 
used herein, the term "antibody" includes both polyclonal and monoclonal 
antibodies, The term "antibody" means not only intact antibody molecules, 
but also includes fragments thereof which retain antigen binding ability. 

Appropriate polyclonal antisera may be prepared by immunizing 
appropriate host animals with marker protein and collecting and purifying 
the antisera according to conventional techniques known to those skilled 
in the art. Monoclonal antibody may be prepared by following the classical 
technique of Kohler and Milstein, Nature 254:493-497 (1975), as further 
elaborated in later works such as Monoclonal Antibodies, Hybridomas: A 
New Dimension in Biological Analysis, R. H. Kennet et a/., eds., Plenum 
Press, New York and London (1980). 

Substantially pure marker protein for use as an immunogen for 
raising polyclonal or monoclonal antibodies may be conveniently prepared 
by recombinant DNA methods. 

As an alternative to immunization with the complete marker protein, 
antibody against marker proteins can be raised by immunizing appropriate 
hosts with immunogenic fragments of the whole protein, particularly 
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peptides corresponding to probable antigenic determinants. Hydrophilic 
regions, which face the environment surrounding the protein, are most 
likely to contain antigenic sites. Such regions can be identified using 
standard computer programs. 

The antibody either directly or indirectly bears a detectable label. 
The detectable label may be directly attached to the primary anti-marker 
protein antibody. More conveniently, the detectable label is attached to a 
secondary antibody, e.g., goat anti-rabbit IgG, which binds the primary 
antibody. The label may advantageously comprise, for example, a 
radionuclide in the case of a radioimmunoassay; a fluorescent moiety in the 
case of an irmmunofluorescent assay; a chemiluminescent moiety in the 
case of a chemiluminescent assay; or an enzyme which cleaves a 
chromogenic substrate, in the case of an enzyme-linked immunosorbent 
assay. 

Most preferably, the detectable label comprises an avidin-biotin- 
peroxidase complex (ABC) which has surplus biotin-binding capacity. See 
Hsu et a/., J. Histochem. Cytochem. 29:577-580, 1981. The secondary 
antibody is biotinylated. Kits for staining proteins by the ABC method are 
commercially available (e.g., Vector Laboratories, Burlingame, CA). 

To determine the presence of marker protein antigen in the tissue 
section under analysis, the section is treated with primary antiserum 
against the antigen, washed, and then treated with the secondary 
antiserum. The subsequent addition of ABC localizes peroxidase at the 
site of the specific antigen, since the ABC adheres non-specifically to 
biotin. Peroxidase (and hence antigen) is detected by incubating the 
section with e.g. H 2 0 2 and diaminobenzidine (which results in the antigenic 
site being stained brown) or H 2 0 ? and 4-chloro-1-naphthol (resulting in a 
blue stain). 
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The ABC method can be used for paraffin-embedded sections, 
frozen sections, and smears. Endogenous (tissue or cell) peroxidase may 
be quenched e.g. with H 2 0 2 in methanol. 

The level of marker protein expression in samples may be compared 
5 on a relative basis to the expression in non-tumor liver tissue samples by 
comparing the stain intensities, or comparing the number of stained cells. 
The lower the stain intensity in the test sample with respect to nontumor 
controls, or the lower the stained cell count in a tissue section having 
approximately the same number of cells, the lower the expression of the 
10 marker gene in the sample, which indicates the presence of HCC in the 
sample. If a control is utilized, it advantageously comprises non-tumor 
liver tissue from another HBV-positive individual or non-tumor liver tissue 
from the patient. 

As a further control of the protein expression, one may preincubate 
15 the immune serum raised against the marker protein antigen with the 
relevant peptide antigen. This should dissipate the signal from the immune 
serum when contacted with healthy or non-tumor cells, confirming that the 
immune serum reagent is indeed specific for the target antigen. 

The diagnostic procedure described herein may take the form of 
2 o detecting the expression of just one of the marker genes. Alternatively, a 
mixture of probes (nucleic acid or antibody) targeting different marker 
genes may be utilized. This may be achieved by pooling two or more 
nucleic acid probes in the case of a nucleic acid hybridization assay, or by 
pooling two or more antisera in the case of a protein assay. By testing for 
2 5 the expression of multiple marker genes in this manner, it is expected that 
the sensitivity of the assay will be increased. 

The following nonlimiting examples are provided to better illustrate 
the claimed invention. 
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Example 1 

Preparation of HBxAg[+] and HBxAg[-] Cell Lines 

A. Cell Lines and Culture Conditions 

HepG2 cells, a differentiated cell line derived from a human 
5 hepatoblastoma (Aden, D. P. et al. 1979. Nature 282:615-617; Knowles, 

B. B. et al. 1980. Science 209:497-499), were cultured on type-1 rat tail 
collagen (Becton Dickinson, Franklin Lakes, NJ) coated tissue culture 
dishes or plates. Cells were grown in Earle's MEM supplemented with 10% 
heat inactivated fetal calf serum (FCS), 100 pM MEM non-essential amino 

io acids, 1 mM sodium pyruvate, as well as standard concentrations of 
penicillin plus streptomycin. The retrovirus packaging cell line PA317 
(Danos, O. 1991. Methods in Molecular Biology, Practical Molecular 
Virology: Viral Vectors for Gene Expression 8:17-27) was also grown on 
plastic dishes in the same medium. 



15 B. Plasmid Construction 

The retroviral vector plasmid, pSLXCMVneo, was used to clone the 
HBV X gene (Valenzeula, P. et al. 1 980. Animal Virus Genetics, Academic 
Press: New York, pp. 57-70) or the bacterial chloramphenicol 
acetyltransferase (CAT) gene sequences for these studies, as described 

20 (Duan, LX. etal. 1995. HumanGene Then 6:561-573). Briefly, pSLXCMV- 
CAT was constructed by inserting a 726 bp Hindlll-BamHI fragment 
containing the CAT gene into the Hpal-Bglll site of the pSLXCMV 
polylinker. PSLXCMV-FLAG-HBx was constructed by inserting a 920 bp 
Mlul-Bglll fragment of FLAG-HBx DNA into the Mlul-Bglll site of the pSLX- 

25 CMV polylinker. Recombinants were used to transform HB1 01. Minipreps 
were prepared and the DNA used for sequence analysis. 
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C. Preparation of Recombinant Retroviruses 
and Infection of HepG2 Cells 

Approximately 1 x 10 6 PA317 cells/100 mm dish were transfected 
using standard calcium phosphate precipitation using 15 pg of pSLXCMV- 
5 FLAG-HBx or 15 pg of pSLXCMV-CAT. At 24, 48, and 72 hours after 
transfection, the medium was removed and processed through a 0.45 pm 
filter to remove PA317 cells, and then used immediately for infection of 
hepG2 cells. Five ml of recombinant retrovirus-enriched supernatant (5 x 
10 6 CFU/ml, as assayed on NIH-3T3 cells) was used to infect 1 x 10 5 target 

ic HepG2 cells/100 mm dish in the presence of polybrene (8 pg/ml) for 24 
hours. Fresh virus supernatant was added after 24 and again after 48 
hours so that the cells were exposed to virus for a total of 72 hours. All of 
these infections were carried out in log phase cultures. Cells were then 
passaged at 1:2 and selected by incubation in G418 (800 pg/ml; 

15 GIBCO/BRL, Grand Island, NY) for 14 days in order to maximize the 
fraction of cells producing HBxAg or CAT. G418 colonies were then 
expanded in normal growth medium and used for analysis. The fourteen 
day selection in G418 had the effect of eliminating most of the uninfected 
cells. 



20 D - Detection of CAT Activity and HBxAg Polypeptide in Transfectants 
The transfectants (HepG2-CAT and HepG2X) were evaluated as 
follows. 

CAT assays were performed as described by Wang et al. (1994. 
Proc. Natl. Acad. ScL USA 91:2230-2234). Briefly, 1 x 10 7 HepG2-CAT 
2 5 cells in a 100 mm dish were lysed by addition of 0.9 ml of 1x report lysis 
buffer (Promega) for 15 minutes and harvested by scraping. Cells were 
pelleted and 180 pi of cell lysate was used for a standard CAT assay. After 
incubation with ^-chloramphenicol, acetyiated forms were separated by 
thin-iayer chromatography. Alternatively, lysates prepared from 5 x 10 6 
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HepG2X cells were assayed for the 17 kDa HBxAg by western blotting 
using a mixture well characterized rabbit anti-x peptide antibodies 
(Feitelson, M.A. and MM. Clayton. 1990. Virology 177:367-371; Feitelson, 
M.A.etal. 1990. Gastroenterology98: 1071 -1078). Horseradish peroxidase 
conjugated goat anti-rabbit Ig (Accurate, Westbury, NY) and ECL substrate 
(Amersham, Arlington Heights, IL) were used for detection. 

CAT activity was present in HepG2CAT, but not in HepG2X cells. 
HBxAg was present in lysates from HepG2X f but not from HepG2CAT 
cells. Together, these findings show that both of the recombinant 
retroviruses are expressing the expected products in HepG2 cells. 



Example 2 

Identification of Differentially Expressed Genes 
15 Distinguishing HepG2X from HepG2CAT 



The differences in gene expression which distinguish HepG2X from 
HepG2CAT cells were determined by using a commercially available 
subtraction hybridization approach (the PCR-select cDNA subtraction kit 
from Clontech, Palo Alto, CA). Briefly, whole cell RNA was extracted 
separately from 1 x 10 7 HepG2X and an equal number of HepG2CAT cells, 
and the quality of the extraction was determined by assaying for 18S and 
28S rRNAs by agarose gel electrophoresis and ethidium bromide staining. 
PCR-select cDNA subtraction is reverse transcriptase (RT)/PCR based, 
and enriches for poly A* RNA (isolated using the Qiagen RNeasy total RNA 
kit; QIAGEN, Inc., Chatsworth, CA) from tissue culture cells or tissues. The 
procedure involved ligating adaptors to some of the PCR products and 
conducting two rounds of subtractive hybridization against the PCR 
products from the cells in which the comparison were being made. The 
resulting products were then PCR amplified using primers which matched 
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the sequence of the adaptors (in the CLONTECH Advantage cDNA PCR 
kit). The unique fragments were then eluted from the gels (using the 
QIAGEN gel extraction kit) and cloned into pT7Blue (Novagen, Madison, 
Wl). Positive clones were selected by blue-white phenotype. Recombinant 
5 DNAs were isolated from minipreps of individual clones, digested by Rsa 
I to check insert size, and then both strands individually analyzed by 
sequence analysis. The sequences obtained were then compared to those 
in GenBank using the FASTA command in the GCG software package for 
homology to known genes. The results are set forth in Table 1, above. 



10 Example 3 

Detection of Hu-suil Differential Expression in Patient 
Tumor and Non-tumor Tissue by In situ Hybridization 



These experiments were performed to detect hu-sui1 transcripts in 
a tumor and nontumor tissues from HBV infected patients. Accordingly, a 
panel of tumor/nontumor tissue pairs from a group of HCC patients was 
analyzed by ISH using the C2 probe. 

The HCC and surrounding nontumor liver tissues were obtained 
from two different sets of HCC patients. The characteristic of the patients 
are set forth in Table 4: 
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Table 4: Characteristics of HCC patients in study 



Patient group African Chinese 

5 Number tested 14 — — — 

Race 13 black, 1 Caucasian 23 Chinese 
Gender 14 male, 0 female 17 male, 6 female 

Age range: 14-72 years 31-68 years 

mean: 39 years 48 years 

1 0 No HBsAg* of total tested 8 of 14 tested (57%) 9 of 23 tested (39%) 

No HBeAg* of total tested 2 of 1 3 tested (15%) 14 of 23 tested (61%) 

No anti-HBc + of total tested 13 of 14 tested (93%) 19 of 23 tested (83%) 

No. anti-HBc* of total tested 2 of 13 tested (15%) 10 of 23 tested (43%) 

No anti-HBs* of HBs tested 5 of 6 HbsAg [-] cases 6 of 14 HbsAg [-] cases 

1 b In Table 4, HBsAg and HBcAg are, respectively, hepatitis B surface 

antigen and core antigen. HBeAg is hepatitis B e antigen, a proteolytic 
fragment of HBcAg which is secreted as a free polypeptide into the blood 
of patients who replicate virus in the liver. HBeAg has thus been described 
as a surrogate marker for virus replication 

2 Q Twenty-three paired tumor/nontumor samples came from as many 

HBsAg positive Chinese carriers who had undergone surgery for the 
removal of their tumors. Most patients lived in and around Xi'an, China and 
were treated at the Fourth Military Medical University. Fourteen additional 
paired tumor/nontumor samples from as many patients were obtained from 
25 South African patients. Half of these were HBV carriers (serum HBsAg 
positive) while the remaining patients, except for one, had evidence of past 
HBV infection (detectable anti-HBs and/or anti-HBc). Formalin fixed, 
paraffin embedded tissues, fresh frozen blocks, and -80°C snap frozen 
paired liver and tumor samples from individual patients were collected from 

3 o most patients, used for diagnostic purposes, and were then made available 

for these studies. Analogous pieces of uninfected human liver from two 
individuals were available to serve as controls. 

Gene fragment C2 obtained from PCR select cDNA subtraction was 
used as a probe in in situ hybridization using the Oncor ISH and 
35 digoxigenenin/biotin detection kits according to the instructions provided by 
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the manufacturer (Oncor, Gaithersburg, MD). The results, shown in Table 
5, demonstrate that hu-sui1 mRNA is easily detectable in nontumor tissue 
from both groups, but that it is rarely present in tumor tissues from the 
same patients. For example, 1 3 of 14 South African patients (93%) and 22 
5 of 23 Chinese patients (96%) had detectable hu-sui1 mRNA by ISH in 
nontumor cells. In contrast, only 1 South African (7%) and 5 Chinese 
(22%) had detectable hu-suil mRNA by ISH in tumor tissue Among the 
Chinese patients with detectable hu-suil in HCC, 3 of the 5 had only trace 
amounts of signal in less than 10% of the tumor cells. Nontumor tissue 
io signals were often more intense and more widespread. These patterns 
were observed in both HBsAg positive and negative patients with HCC in 
both ethnic groups. These results demonstrate that hu-suil is differentially 
expressed in tumor compared to nontumor tissue. 
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Table 5: Summary of in situ hybridization for C2 probe in tumor 
/nontumor pairs for HCC patients from south Africa and 
China* 

South Africa patients 



5 


case no: 


1 


2 


3 


4 


5 


6 


7 






tumor 


0 


0 


0 


0 


2 


0 


0 






nontumor 


3 


3 


3 


3 


4 


3 


3 






case no: 


8 


9 


10 


11 


12 


13 


14 






tumor 


0 


0 


0 


0 


0 


0 


0 




10 


nontumor 


3 


2 


4 


3 


2 


0 


3 






Chinese patients 


















case no: 
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8 




tumor 
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1 


0 


0 


0 


0 


1 




nontumor 


3 


2 


4 


3 


3 


4 


3 


3 


15 


case no: 
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10 


11 


12 


13 


14 


15 


1C 




tumor 


0 
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0 


0 


1 


0 


0 


0 




nontumor 


4 


0 


4 


4 


3 


3 


3 


4 




case no: 


17 


18 


19 


20 


21 


22 


23 






tumor 


0 


0 


2 


0 


0 


0 


0 




20 


nontumor 


3 


3 


3 


3 


4 


4 


4 





a ln situ hybridization (ISH) staining is estimated as follows: 0: no signal; 1: 
ISH signal in <10% of cells; 2: ISH signal in 10-25% of cells; 3: ISH signal 
in 25-50% of cells; 4: ISH signal in >50% of cells. 



25 Example 4 

Detection of Differential Expression in Patient 
Tumor and Non-tumor Tissue by In situ Hybridization 

The HCC and surrounding nontumor liver tissues used for analysis 
were obtained from five HBsAg positive Chinese carriers who had 
3 0 undergone surgery for the removal of their tumors. These patients were 
treated at the Fourth Military Medical University, Xi'an, China. Formalin 
fixed, paraffin embedded tissues fresh frozen blocks, and -80°C snap 
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frozen paired liver and tumor samples from individual patients were 
collected, used for diagnostic purposes, and were then made available for 
these studies. In many cases, tumor was dissected from nontumor just prior 
to snap freezing. Analogous pieces of uninfected human liver from two 
5 individuals were available to serve as controls. 

In situ hybridization was carried out using probes L4, L7, L8, L11, 
L12, L15, L16, L19. C1 and C2, and the Oncor ISH and 
digoxigenenin/biotin detection Kits according to the instructions provided by 
the manufacturer (Oncor, Gaithersburg, MD). The results are shown in 

10 Table 6. The probes detected transcripts that were preferentially 
expressed in nontumor. compared to tumor tissues, in most cases. These 
results were not due to differences in the ability of the tumor tissue to 
uptake probe, since tumor cells from three of the five HCC patients 
hybridized strongly to a 320 bp alphafetoprotein probe (data not shown). 

IB Hence, the probes that distinguish HepG2X from HepG2CAT cells also 
distinguish tumor from nontumor in carriers with HCC. 
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Table 6: In situ hybridization results from PCR select cDNA 
amplification 



clone hybridization" in tumor/nontumor (T/NT) in patient: uninfected 

HepG2 HepG2 1 2 3 4 5 liver 

-X -CAT TNT TNT TNT TNT TNT 



L7 


++ 


± 


4-/4-4- + 


+/+ + + 


-/++ + 


_/+ 


+/+ 


+ + 


L8 






-/ + + 


-/+ + 


-/■*- 


+/++ 


-/++ 


4 


M2 


+ T 


± 


-/+ 


4-/4-4- 


-/+ + 


-/+ + 


-/+ + 




L16 


+ 




±/+ + 


-/+ 


-/- 


±/+ 


+/+ + 


+ 


L19 


+ + 


+ 


-/+ + 


-/+ + 


+/+ + 


-/+ 


-/+ + 


± 


L4 


+ + 


+ 


-/+++ 


++/- 


4-4- + 


±/+ 


-/+ 


+ 


L11 


4 




-/+++ 


-/+ + + 


+/+ + 


-/+ 


-/+ + 


+ 


L15 


+ 4- 


+ 


+/+ + 


-/+ + 


+/+ + 


-/+ + 


-/+ + + 


+ 


C2 




+ + 


-/+ + 


-/+ 


-/+ + + 


-/+ 


-/+ 


+ + + 


C1 


± 


+ + 


-/++ 


-/+ + 


-/+++ 


+/+ + + 


+/+ + 


+ + + 



a ln situ hybridization (ISH) signals were as follows: - : no signal; ± : 1-10% of the cells were 
positive for the corresponding probe; + : 1 1-25% of the cells were positive; ++ : 26-50% of 
the cells were positive; +++ : >50% of the cells positive. In the great majority of cases, 
HbxAg was observed in nontumor liver. Tumor cells were either faintly positive for HbxAg 
or completely negative. 



Example 5 

Detection of Differential Expression in Patient 
Tumor and Non-tumor Tissue by Immunostaining 

A Peptides 

Synthetic peptides that represent probable antigenic determinants 
on each of the differentially expressed proteins were prepared by solid 
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phase peptide synthesis and analyzed by HPLC and amino acid 
composition prior to use. The peptides are identified in Table 7, below. 
The The peptides were coupled by virtue of their free cysteine sulfhydryl 
(either in the peptide sequence or added to the carboxy or amino terminus 
5 where the native sequence did not contain a cysteine) to keyhole limpet 
hemocyanin (KLH; Sigma) using the coupling agent m-maieimidobenzyol- 
N-hydroxysuccinimide ester (MBS; Pierce) as described by Liu ef a/., 
Biochemistry 18:690-697 (1979). 
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Table 7: Peptide fragments of differentially expressed proteins 



Gene Peptide Sequence Peptide 

Position 
in Protein 

C2 C2.1 DDYDKKKLVKAFKKKFAC 52-69 
(SEQ IDNO:16) 

C2.2 EHPEYGEVIQLQGDQRKNIC 75-94 
(SEQ ID NO:17) 



10 L4 L4A CQKAKDRMERITRKIKDSDAYRRDE 460-484 

(SEQ IDNO:18) 

L4B PRPRDKRQLLDPPGDLSRC 821-838 
(SEQIDNO:19) 



L7 L7A CGVWNQTEPEPAATS 12-25 
15 (SEQIDNO:20) 

L7B HHHGRGYLRMSPLFKC 56-70 
(SEQIDNO:21) 



L11 1L11 PCPELACPREEWRLGP 2-17 
20 (SEQIDNO:22) 

3L11 DPSRSPHSTSSFPRGSSATSCDSR 316-339 
(SEQ ID NO:23) 

4L11 HPPDGSFSTFHDGPQPLEDPC 359-378 
25 (SEQ ID NO:24) 



L12 L12.1 KSINNAEKRGKRC 12-23 
(SEQ ID NO:25) 



30 



L12.2 



DHEERRRKHTGGKC 
(SEQ ID NO:26) 



112-124 
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B. Antibody Production 

For antibody production, 5- to 10-week old female New Zealand 
White rabbits (2 animals/peptide; Hazelton) were bled and then injected 
with peptide conjugate as described (Bittle et al. , Nature 298:30-33, 1 982). 
5 Dilutions of immune sera were assayed in parallel with preimmune sera in 
solid-phase assays in wells (Immunolon 2 Removawell Strips, Dynatech 
Labs) coated with the appropriate (unconjugated) synthetic peptide. See 
Feitelson etai. Gastroenterology 98: 1071-1078, 1990 or Feiteison et al., 
J Med Virol 24: 1 21 -1 36, 1 988 for additional details of the solid phase assay 
io design. 

C. Immunohistochemical Staining 

Antisera generated from two or three peptide antigens of the same 
protein (Table 7) are pooled, and used in immunohistochemical staining 
assays as follows. Paired tumor/nontumor tissue samples from HBV- 

15 associated liver cancer HCC patients comprise the test samples. 

Tissues are fixed in 1 0% formalin, embedded in paraffin and serially 
sectioned at 5m per section. Sections are then stained for individual 
differentially expressed proteins by the avidin-biotin complex (ABC) method 
(Hsu et a/., J. Histochem Cytochem 29; 577-580) using a kit purchased 

2 0 from Vector Laboratories (Burlingame, CA). Staining is detected by 
addition of diaminobendizine (DAB) substrate, and the sections then 
counterstained with Mayer's hematoxylin. The degree of positive reaction 
is scored from 0 to +++. The grade 0 indicates no demonstrable antigen, 
+ mild, ++ moderate and +++ dark staining. 

25 D. Results - C2.1/C2.2 Antibody Mixture 

Staining with antisera comprising antibodies against the C2.1 and 
C2.2 antigens indicated the presence of the Sui1 protein by brown color in 
the cytoplasm of nontumor cells surrounding the HCC tumor tissue. Very 
little staining appeared in the tumor cells of the HCC tissue. 
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D. Controls 

The specificity of staining for differentially expressed protein was 
demonstrated by the following controls, (a) Several uninfected human liver 
samples, and several tissue types from other organs (spleen, lymph node, 
5 muscle, nerve, and gall bladder), were tested with immune sera, (b) 
Preimmune and normal rabbit sera were tested with positive liver sections, 
(c) The synthetic peptide(s) used to raise immune sera was tested for 
blocking of staining when preincubated with corresponding antisera prior 
to staining, (d) Liver powder made from uninfected human liver tissues was 
io used to absorb the primary antibodies prior to staining. Peptide antisera 
were tested by western blotting with E. coli lysate from bacteria expressing 
the corresponding L or C polypeptide compared to a similar lysate from 
untransfected host cells. The results of each of these control procedures 
supported the specificity of staining. 



is All references cited with respect to synthetic, preparative and 

analytical procedures are incorporated herein by reference. All sequence 
records identified by GenBank accession numbers are incorporated herein 
by reference. 

Theentire disclosure of U.S. provisional patent application Serial No, 
2 0 60/072,938 filed January 29, 1998, is incorporated herein by reference. 

The present invention may be embodied in other specific forms 
without departing from the spirit or essential attributes thereof and, 
accordingly, reference should be made to the appended claims, rather than 
to the foregoing specification, as indication the scope of the invention. 
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What is claimed is: 

1 . A method for detecting hepatocellular carcinoma in liver tissue 
of a patient comprising: 

assessing the level of expression of one or more marker 
genes in a liver tissue sample from the patient, which marker genes 
are differentially expressed in HBxAg[+] cells as compared with 
HBxAg[-] cells, a reduction in the level of expression of said one or 
more marker genes in the sample as compared to the expression 
level in noncancerous liver tissue being indicative of hepatocellular 
carcinoma in the sample. 

2. The method according to claim 1 wherein the one or more 
marker genes is selected from the group of genes expressing an RNA 
transcript which hybridizes under conditions of high stringency to a nucleic 
acid probe selected from the group consisting of SEQ ID NO:1. SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, and SEQ ID NO:10. 

3. The method according to claim 1 wherein the marker gene is 
selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID NO:27; 
a gene which encodes the polypeptide of SEQ ID NO:28; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
hu-sa/; 

human tubulin-folding cofactor E gene; 
human myeloblast KIAA0132 gene; and 
the human fetal heart gene, the cDNA of which is identified 
as GenBank accession number AA047006. 
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4. The method according to claim 3 wherein the marker gene is 
hu -sui1. 

5. The method of claim 1 wherein the step of assessing the 
expression of said one or more marker genes comprises contacting said 
sample with one or more probes which detect mRNA which is differentially 
expressed in HBxAg[+] cells as compared with HBxAgf-] cells. 

6. The method of claim 5 wherein the mRNA detected hybridizes 
under high stringency conditions to a nucleic acid probe having a 
nucleotide sequence selected from the group consisting of SEQ ID NO:1 , 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, and SEQ ID NO:10. 

7. The method of claim 1 wherein the step of assessing the 
expression of said one or more marker genes comprises detecting marker 
protein encoded be said one or more marker genes. 

8. A method according to claim 7 wherein the one or more 
marker genes is selected from the group of genes expressing an RNA 
transcript which hybridizes under high stringency conditions to a nucleic 
acid probe selected from the group consisting of SEQ ID NO:1, SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO;6, SEQ ID 
NO:7 t SEQ ID NO:8, SEQ ID NO:9, and SEQ ID NO:10. 

9. The method according to claim 7 wherein the marker gene is 
selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID NO;27; 
a gene which encodes the polypeptide of SEQ ID NO:28; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
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a gene which encodes the polypeptide of SEQ ID NO:29; 
hu-stv/; 

human tubulin-folding cofactor E gene; 
human myeloblast KIAA0132 gene; and 
5 the human fetal heart gene, the cDNA of which is identified 

as GenBank accession number AA047006. 

10. The method of claim 7 wherein the one or more marker 
proteins are detected by contacting the sample with one or more antibodies 
which bind said marker proteins. 

1 c 11. A method according to claim 1 0 wherein the marker gene is 

hu-suil. 

12. A method for diagnosing hepatocellular carcinoma 
comprising: 

obtaining a liver tissue sample from a patient; 
15 assessing the level of expression of one or more marker 

genes in the sample, which marker genes are differentially 
expressed in HBxAg[+] cells as compared with HBxAg[-] cells, a 
reduction in the level of expression of said one or more marker 
genes in the sample as compared to the expression level in 

2 0 noncancerous liver tissue being indicative of hepatocellular 

carcinoma in the sample. 

13. The method according to claim 12 wherein the one or more 
marker genes is selected from the group of genes expressing an RNA 
transcript which hybridizes to a nucleic acid probe selected from the group 

2b consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4. 
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SEQ ID NO:5. SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
and SEQ IDNO:10. 

14. The method according to claim 12 wherein the marker gene 
is selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID NO:27; 
a gene which encodes the polypeptide of SEQ ID NO:28; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
hu-su/; 

human tubulin-folding cofactor E gene; 
human myeloblast KIAA0132 gene; and 
the human fetal heart gene, the cDNA of which is identified 
as GenBank accession number AA047006. 

15. The method according to claim 14 wherein the marker gene 
is hu-sui1 . 

16. A method for identifying alterations in gene expression 
patterns in a biological sample that are induced by hepatitis B x antigen 
comprising the steps of 

obtaining a biological sample; 

contacting said sample with a probe which detects an mRNA 
which is differentially expressed in HBxAg[+] cells as compared with 
HBxAgf-] cells; and 

detecting expression of a gene encoding said mRNA detected 

by said probe 



1 7. The method of claim 1 6 wherein the biological sample is liver 

tissue. 
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18. The method of claim 1 7 wherein the liver tissue comprises a 
hepatocellular carcinoma. 

19. A method for identifying patterns in gene expression in a 
biological sample that are altered by hepatitis B x antigen comprising the 
steps of 

obtaining a biological sample; 

contacting said sample with an antibody which detects a 
protein which is differentially expressed in HBxAg[+] cells as compared with 
HBxAgH cells; and 

detecting expression of a gene encoding said protein 
detected by said antibody. 

20. The method of claim 1 9 wherein the biological sample is liver 

tissue. 

21 . The method of claim 20 wherein the liver tissue comprises a 
hepatocellular carcinoma. 
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SEQUENCE LISTING 



<1I0> Feitelscn, Mark 

<120> Molecular Markers for Diagnosing Hepatocellular 
Carcinoma 

<130> 36seq 

<140> Not Yet Assigned 
<141> 1999-01-15 

<15C> 60/072, 938 
<1S1> 1998-01-29 

<16C> 26 

<170> Patentln Ver . 2.0 

<210> 1 
<211> 488 
<212> DNA 

<213> Homo sapiens 
<400> 1 

tgtccccact cttcaaagcc aaga-:ggrag czgccaLccc tggggagcct ggaaccaggc 60 
adtgttcggg ggaggcaggg gacaggctgg aacctggtga agtcttaaag taaactcctc 120 
ctatcggggt gtagaaggga atc-gttaat caaacagagc aatattagaa aggctacaga 180 
ggtcaactca gtcgaacacg gt:cucccaa acagattttg taattccgaa aatccacgca 240 
tgcgcaaaca tacgcataca ctcccatgtt cctggacagt ttatagctac cataacctgg 300 
cattttccaa aacataccat gtagactctt ggatacacaa ggtaatttta gagccacatt 360 
acgatgaacc ttttaaaaag ttatcattta tttttatntt cccccactgg ctgtattata 420 
agacaatttt tatatgtgat argtatttac cttagtgtgt taaataaaca cnggcattcc 480 



<210> 2 

<211> 90 

<212> DNA 

<213> Homo sapiens 

<400> 2 

acctgcccgg gcggccgctc gagccctata gtgagtcgta ttagaggccc ttgtagcgtg 60 
aagacgacag aaagggcgtg gtgcggaggg 90 

<210> 3 
<211> 360 
<212> DNA 

<213> Homo sapiens 



ctaaaaaa 



488 
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<400> 3 

tgaaaaccaa actggcggga tggaagcaga ttattctgcc atttttccag gLclttgagt 60 

tgcacgtcaa atctggggct gatcacccca cacttgttta gcctgcctgt gaggttcaca 120 

acaattttcc cagctctgtg gtcatcaatg atttcaaatt cgccaatgta accatgcttc 180 

atcatcacag tgagaaaccg gacgatgact ttggagcacg gcctaataag cacctggcgt 240 

ttgcctctct tttcggcatt gttgatactc ttgagagcat ctgccaggac attcatgcgc 300 

accattgtgg cgccgcggaa aacctgcccg ggcggccgct cgaaatccat atgactagta 360 

<21C> 4 
<211> 90 
<212> DNA 

<213> Hone sapiens 

«- a o n a 
— « w - - - i 

acctgttgag gcacttttgt ttcttgggca aaaatacagt ccaatggaga gratcattgt 60 
ttttgtaccg ccctccccac cacgccctaa 90 

<210> 5 
<211> 199 
<212> DNA 

<213> Horr.o sapiens 
<400> 5 

tacgggaagg cgaagaaaag aatacaqaag atagggaaat tagaagataa aaacatacLt 60 
ttagaggaaa aaagataaat ttaaacctga aacgtaggaa gcagaagaaa aaagacaagc 120 
taggaaacaa aaagctaagg ccaaaatgta ctgaaaatca agatcaagcg agettttgee 180 
cttctgctcc aegggaggt 199 

<210> 6 
<21!> 658 
<212> DNA 

<213> Homo' saoiens 



<400> 6 

tcagggaaag ccttcacaga tcagtcaata 
ccacgggcac tcacatcttg aatgctggtc 
aggectaett ctgcttcctc aggacaactt 
cctgggcctg cagtgactaa ggaegctget 
ctatacagag ggcccacccc gcaggatcct 
cttgtctccc taccctgggg gctgtgatat 
ggtcacttcc tcttttaaca ctgttgagga 
tctcatgtcg attgggcacc agccattctc 
cccatcagac agggctgccc acagcagcct 
tcacagccgt ctcaccagct caatgagctg 
gttcctgatg ttttccagta ggcatctctt 

<210> 7 



aatacggtgc catgggatgt gccttgcaca 60 
cactggaggc ccttgggttg ggegggagea 120 
ccccacctct gtcctgggac cacctgcccg 180 
cccactccag gggccagtga cagagagcag 240 
tgacaggagc tgagacagaa caaactgetg 300 
tcttggtaac atctctgagc tggtctgtga 360 
gactccaaac cctctgtctt ctgctcgtct 420 
aggcaccaga gcacagcccc acacgggtgc 4 80 
cctacacctg aactgggttt ctctgcacac 540 
ctggatgttt ttgttttggt tcgacaagcc 600 
caattcaaat atggcttcac tgtaagee 65B 
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<2I2> DNA 

<213> Homo sapiens 



<40C> 7 

tcgagccggt accgggnncc gatttgtagc gtgaanacga cagaaaaggc gtggtgcgga 60 

gggccgtgta gcgtgaagac gacagaaagg gcgtggcgga gggcggtacc taccctttcc 120 

accctgacgg ggagtgctgc cccgtgtgcc gagactgcaa ctacgaggga aggaaggtgg 180 

cgaatggcca ggtgttcacc ttggatgatg aaccctgcac ccggtgcacg tgtcagctgg 240 

gaagaggtga gctgtgagaa ggttccctgc cagcgggcct gtgccgaccc tgccctgctt 300 

cctggggact gctgctcttc ctgtccagat tccctgtctc ctctggaaga aaagcagggg 360 

ctctcccctc acggaaatgt ggcattcagc aaagctggtc ggagcctgca tggaagacac 420 

tgaggcccct gtcaactgta gctcctgtcc tgggcccccg acagcatcac cctcgaaggc 480 

cqgtgcttca tctcctccag ctccttttaa gaar.gaactt gatgaaaaca cacactttac 540 

ctacaagccc ggcaggagct catggtccac actcactcg: ttLggggrrtg acagccactt 600 

<210> 6 
<211> 532 
<212> DNA 

<213> Homo sapiens 
<400> 0 

aagccccccc ctttcccttn tttttcttnc ccaanngggg natnccgqcc cnttnggntg 60 
aaccaan::a acnnatcccc naaanatggt nccccnntgg aataanccr.t gggttactnt 120 
ccaaaccaat tttaccccta tttactttaa atggaattaa ar.cctccctn tttcattttt 180 
aaagggatca nggigaaaat cccnttgnaa aaccccccna accaaaancc ccttaaant: 240 
nattitccc: tccccgggaa cUncnaacc cnctaaaaaa anaaanaanc gtttccncnn 300 
aaarncnttt tttccgcccn ctttatatcg ncgccttngn aaaananaaa aaancccccc 360 
nccccccggg tggtttttnt ncccggattt aaaaancccc nntctttttt ccaaaggttt 420 
ccggntnccc ccaaaacncc aanacccncn ntcggncccc ccttttttgc cccngntnng 480 
ggcccccncc naaaaatctt ntccccccnc cntcatcncn cgctngnnaa tn 532 

<210> 9 
<211> 425 
<212> DNA 

<213> Homo sapiens 



<400> 9 

gctttttttt tttttttttt tttttttttt 
aagggtttaa tacattacac ataacattaa 
gtttgttact tcacatggca ttgggcagct 
gctacatgac tgatggatca gtttgagatt 
gaaggttggc ctcacattct gatgtttgga 
cagacctttg tggcaagcca gatgtcctat 
agctctgtcc acctagtcag gttggaaaca 
aatag 

<210> 10 



tttatttctg aaaacaagtt rtatttaaat 60 

aactgaaggg gaaaaaaaaa ccaaaaacca 120 

gctgctatta agttgcaagc tctacagcta 180 

tgttcccttg tcaaaagttt aactctgata 240 

catcccttag ctaggatatg tctggtcgaa 300 

cacctcccta gcggtaagag ggcctctttg 360 

ccaggggatc taccacaaaa gctcccttct 420 

425 
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<212> DNA 

<213> Homo sapiens 



<400> 10 

cagncggtac cggggnnccg attgtagcgt 
ggcggttgta ncgtgaanac gacagaaacg 
cgggcggccg ctcgagccct atagtgagtc 
cttgtagcgt gaaaacnaca gaaagggcgt 
cagaaagggc gtggtgcgga gggcggttgt 
ctatagtgag tcgtattang ccctatagtg 
taggccctat- agtgagtcgt attaccccta 
gtattangcc ctatagtgag tcgtattacg 
tgagtcgtat tangccctat agtgagtcgt 
ctatagtgag tcclattagg ccttgtancg 
ttaaaaatcc atatgactag tnnatcctct 
cctatanttg agtcgtatta aaagcttggc 

<210> 11 
<211> 3607 
<212> DNA 

<213> Homo sapiens 



gaaagacgac acaaaaggcg tggtgcggan 60 
ggcgtggtgc ^cgcagtgga ggtacctgcc 120 
gtattacccc tatagtgagt cgtattaggc 180 
ggtgcggagg gcggtgtacc gtgaanacaa 240 
agcgtgaana cqacagaacc cgcncgagcc 300 
agtcgtatta ngccctatag tgagtcgtat 360 
tagtgagtcg tattagccct atagtgagtc 420 
ccctataatg agtcgtatta ngccctatag 480 
attagqccct atagtgagtc gtattangcc 540 
teaaaacadc: ^-cctatan ttgagtcgta 600 
agantcnacc tcacaggcat gcaaqctttc 660 
gta 693 



<403> 11 

cctcgctgcg ctcatggcgt cgcccgggca 
aaaagcatca gagagacgaa cagctgtggc 
aggagatgat tgcgagttcc gttatggaga 
tccaacagtg gagagaagca ggcttcaaga 
ccaggtccag aaactcagcc tccaggactc 
ctgggcccct caggttccca aagacttgcc 
caatgctgat gccaggaata ccactatggt 
ggagaaggag agccagatgg aagaggagat 
tgccgacatt tattcctttt ctgacctgcc 
ccttctctgt gccctgctgc tctcctcaga 
aatggccctc tgccagtttg cactcccact 
tacatttctg ctgtgggcca tgcggggcat 
gggcatgggg agcttccggg aagacagcgt 
cgtgcgcatg gacgtcagta gcaactccaa 
gggccacagg cagtgggact gcttctggca 
ggagatttcg gatgggttgg tagaaatttc 
ggacattttc ccagaacctg tggcctttct 
gctgcagttt aagctcttga cagaaatctc 
cagtaagaag gaatacaaat tgctgtactc 
catcctgagb ccctaccgtg ggaagcgcaa 
tcctgtgctg aaaatagacc actcacatgt 
cagcttcgtg aagaggatcc gggccatcgt 
ggtatctgtg gaggacatgg cgcacgcagc 
ctgtgaggag tgtcagaaag cgaaagaccg 



ttcagatttg gcagaagtag ccccagaaat 60 
cattgcagat ttggaatgga gagaaatgga 120 
tggtacaaat caggctcagg acaatgattt 180 
aatgctgtca cttttgggcc tagagacgta 240 
tctgcagatc agttttgaca gta^gaagaa 300 
ctggaatttc ctcaggaagt tgcaggccct 360 
gctggacgtg ctcccagacg ccaggcctgt 420 
catctactgg gacccagctg atgaccttgc 480 
cacccctgat acgccagtga accccttaga 540 
cagtttcctg caacaagaaa tagcgttgaa 600 
cgtgttgcct gactcggaga accactacca 660 
tgtgaggaca tggtggtccc agcccccaag 720 
ggtcttgtcc agggcgcccg ccttcgcctt 780 
gtcccagctt ctcaacgccg tcctcagccc 840 
tcgggacctc aacttgggca ccaatgcccg 900 
ctggtttttt cccagcggaa gggaggactt 960 
gaacctgaga ggtgacatcg ggtctcactg 1020 
ctccgctgtg tttatattga ctgacaatat 1080 
catgaaggag tcaaccacaa aatactactt 1140 
cacaaacctg agatttctga ataagttaat 1200 
cctggtaaag gtcagcagca ctgacagcga 1260 
tgggaatgtg ctgcgggcac cctgcaggcg 1320 
ccgcaaactg ggcctaaagg tcgacgagga 1380 
gatggagagg attaccagga aaatcaaaga 1440 
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ctcggatgcc tacagaaggg acgaqctgag gctgcagggg gacccctgga gaaaggcagc 1500 
ccaagtggag aaggagttct gccagctcca gtgggccgtg gacccccctg agaagcacag 1560 
ggctgagctg aggcggcggc tgctagaact tcgaatgcag cagaacggcc atgattcctc 1620 
ctcgggggtg caggagttca tctcggggat cagcagcccc tccttgagtg agaagcagta 1680 
cttcctgagg tggatggagt ggggcctggc acgggtggcc cagccgcgac tgagacagcc 1740 
tccggagacg cttctcaccc tgagaccaaa gcacgggggc accacagacg tgggggagcc 1800 
gctctggcct gagcccctag gggtggaaca cttcttgcgg gagatgggac agttttatga 1860 
ggctgagagc tgtcttgtgg aggcagggag gctgccggca ggccagaggc gttttgccca 1920 
cttcccaggc ttggcctcgg agctgctgct gacagggctg cctctggagc taatcgatgg 1980 
gagcacgctg agcatgcccg tccgctgggt cacagggctc ctgaaggagc tgcacgtccg 2040 
actggagaga cggtcaaggc tggtggttct gtcaaccgtc ggggtgccag gcacgggcaa 2100 
gtccacactc ctcaacacca tgtttgggct gcggtttgcc acagggaaga gctgcggtcc 2160 
tcgaggggcc ttcatgcagc tcatcacagt ggctgagggc ttcagccagg acctgggctg 2220 
tgaccacatc ctqgtqataq actccggggg cttgataggt ggggccttga cgtcagctgc 2280 
cgacagattt gagctgcacc ctt^ctLugu cactctgctc atgggactga gcaatgtcac 2340 
cgtgatcagt ctagctgaaa ccaaggacat tccagcagct attctgcatg catttctgag 2400 
gttagaaaaa acggggcaca tccccaacta ccagtttgta taccagaacc ttcatgatgt 2460 
atctgttccc ggccctagqc ccacagecaa gagacagctc ctggatccac ctggtgacct 2520 
gagcagggct gcagcccaga tgaagaaaca gggcgacggc ttccgggcac tggcaggcct 2580 
cgccttctgc gaccctgaga accagcacat ctggcacatc ccaggcctgt ggcacggagc 2640 
acctcccatg gccgcagtga ccttggccta cagtgaagcc atatttgaat tgaagagatg 2700 
cctactcgaa aacatcagga acggcttgtc gaaccaaaac aaaaacatcc agcagctcat 2760 
tcacctggtg agacggctgt aactgtgcag agaaacccag ttcaggtgta ggaggctgct 2820 
gtgggcagcc ctgtctgatg ggccacccgt gtggggctgt gctctggtgc ctgagaatgg 2880 
ctggtgccca atcgacatga gaagacgagc agaagacaga gggtttggag tctcctcaac 2940 
agtgttaaaa gaggaagtga cctcacagac cagctcagag atgttaccaa gaatatcaca 3000 
gcccccaggg tagggagaca accaccagtt tgttctgtct cagctcctgt caaggatcct 3060 
gcggggtggg ccctctgtat agctgctctc tgtcactggc ccctggagtg ggagcagcgt 3120 
ccttagtcac tgcaggccca cccccccagg tggtcccagg acagaggtgg ggaagttgtc 3180 
ctgaggaagc agaagtaggc cttgctcccg cccaacccaa gggcctccag tggaccagca 3240 
ttcaagatgt gagtgcccgt ggtgtgcaag gcactcccat ggcaccgtat ttattgactg 3300 
atctgtgaag gcttccctga cccctgccca ggaagagttc actggtcgct ctgttgtgcc 3360 
ccacagcact ttgttatacc tctgccacac acttcacgca gcgcgttgta actcatgtgt 3420 
ttacatgtct gtccccccag actgtgagct ccttgagggc agggactgta cattctccag 3480 
ctctgtgtcc ccagggcctg gcacattgta gacgcttaat aaatgtctgt taaatgaaaa 3540 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3600 
aaaaaag 3607 

<210> 12 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<400> 12 

cttaccgaca gacagacgct gggacccacg acgacagaag gcgccgatgg ccgcctgctg 60 
agccctgccc ggggcagggg gtctggaacc agacagagcc tgaacctgcc gccaccagcc 120 
Luctgagcct gtgcttcctg agaacagcag gggtctgggt accccccatg tacctctggg 180 
tccttggtcc catctacctc ctcttcatcc accaccatgg ccggggctac ctccggatgt 240 
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ccccactctt caaagccaag atggtagctg ccatccctgg gagcctggaa ccaggcaatg 300 

ttcgggggag gcaggggaca ggctggaacc tggtgaagtc ttaaagtaga ctcctcctat 360 

cggggtgtag aagggaatct gttaatcaaa cagagcaata ttagaaaggc tacagaggtc 420 

aacrcagtgg aacatggttc tcccaaacag attttgtaat tccgaaaatc cacgcatgcg 480 

caaacatacg catacactcc catgttcctg gacagtttat agctaccata acctggcatt 54 0 

ttccaaaaca taccatgtag actcttggat acacaaggta attttagggc cacattagga 600 

tgaacctttt. aaaaggttat gcatttaLLt ttatgtttcc ccactagctg tattatagga 660 

caatttttat atgtgatatg tatttacctt agtgtgttaa ataaacactg gcatttcaaa 720 

aaaaaaaaaa aaaaaaaaaa aaagcggccg ctg 753 

<210> 13 
<211> 2251 
<212> DNA 

<213> Homo sapiens 



<400> 13 

gaattctagg gggaacctca gacccccctc 
tcctcaccca ggggccctga gtccccccga 
ctgggagcca tgcatgaatc aaggagtcgc 
tgcgaggacg ggaaggtgac ctgtgaaaag 
attccctcca gagatggtgg gtgctgccca 
gtccgagctg aaggggatgt gttttcacct 
ctggctggaa acgtgtcctg catctctcct 
ccacagacgg attgctgtac ttgtgttcca 
gcagacgggg ctgtgttcag tgggggtggt 
ggggaggtgg agtgctcctt catgccctgc 
cggctgggcc ctgggcagtg ttgcttcacc 
tctcttgacg acaacggggt tgagtttccg 
tgtgagttat gcatctgcca ggcagatggc 
gactcctgcc ctcacccgat ccggatccct 
tgcacctaca caggcagaat cttctataac 
tgtctgagct gcatctgcct gctgggctca 
acctgtacct accctttcca ccctgacggg 
tacgagggaa ggaaggtggc gaatggccag 
cggtgcacgt gccagctggg agaggtgagc 
gccgaccctg ccctgcttcc tggggactgc 
tctggaagaa aagcaggggc tctcccctca 
gggagcctgc atggagacac tgaggcccct 
acagcatcac cctcgaggcc ggtgcttcat 
atgaaaacac agactttacc tacaagcccg 
tt: 999gctga cagccacttt cccaggggag 
ccttcgaccc ctccaggagc ccccactcta 
cctcctgtga ctccagagcg ctcgttctca 
cctcctctgc ctggcaccct cctgacggaa 
ccctcgaaga cccccatcac cctcctcggg 
ctctccacag cccttgcagc caccacccac 
tctcgggggg aagagtccao catgtaagga 
ggacctctgc cagtggccca gggtgtgtgc 



actccttcag ggggaggtga tgggacccct 60 
ctggcagcag ggccctctcc ctgctggcac 120 
tggacagagc ctgggtgttc ccagtgctgg 180 
gtgaggtgtg aagctgcttg ttcccaccca 240 
tcgtgcacag gctgttttca cagtggtgtc 300 
cccaatgaga actgcaccgt ctgtgtctgt 360 
gagtgtcctt ctggcccctg tcagaccccc 420 
gtgagatgct atttccacgg ccggtggtac 480 
gacgagtgta ccacctgtgt ttgccagaat 540 
cctgagctgg cctgcccccg agaagagtgg 600 
tgccaggagc ccacaccctc gacaggctgc 660 
attggacaga tctggtcgcc tggtgacccc 720 
tcggtgagct gcaagaggac agactgtgtg 780 
ggacagtgct gcccagactg ttcagcaggc 840 
aacgagacct tcccgtctgt gctggaccca 900 
gtggcctgtt cccccgtgga ctgccccatc 960 
gagtgctgcc ccgtgtgccg agactgcaac 1020 
gtgttcacct tggatgatga accctgcacc 1080 
tgtgagaagg ttccctgcca gcgggcctgt 1140 
tgctcttcct gtccagattg ccctgtctcc 1200 
cggaaaatgt ggcattcagc aaagctggtc 1260 
gtcaactgta gctcctgtcc tgggcccccg 1320 
ctcctccagc tccttttaag aacgaacttg 1380 
gcaggagctc atggtccaca ctcactcgct 1440 
cctggggcct cccctcgact ctcaccaggg 1500 
cctctagctt ccccaggggc tcctcagcca 1560 
gcctctgggg cccagatagt gtccaggtgg 1620 
gcttcagcac tttccatgat ggaccccagc 1680 
cctcgcgtgc tttctcccac cacctctaga 1740 
cctggccccc agcagccccc agtgggggct 1800 
ggtcactgtg tccgggagac tctggagaga 1860 
agggcacctc caaggatgaa cctggtgggg 1920 
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atgcctgggc tccctcctgc acgggccctg gtgaggatgg aagaccccca aggctggatg 198C 

taaccttgtt cccaagaagt gtttggaatg tgctgtaaga atggaggaag tcgtttccac 2040 

tgtcagcatc ctcccctgga ccgcgtggct ggctcatctt ttgagaaggg ttgggactgc 2100 

caagttttcc tggaggaaga gttgcgtccg gctgggattc cactcactgg gactgtaccg 2160 

ccaggtgtca tgcgtctttc tgaggtttcc tgattaaagg ttgtttcggt ttcctaaaaa 2220 

aaaaaaaaaa aaaaaaaaaa aaaaagcggc c 2251 

<210> 14 
<211> 541 
<212> DNA 

<213> Homo sapiens 
<40C> 14 

gccacaatgg tgcgcatgea tctcctcgca gatcctctca agagtatcaa caatgccgaa 6C 
aagagaggca aaccccaggt gctlattacg ccgtcctcca aagtcatcgr cccgtttctc 12C 
actgtcatga tgaagcatgg ttacattggc gaatttgaaa tcattgatga ccacagagct 18C 
gggaaaattg ttgtgaacct cacaggcagg ctaaacaagt gtggggtgat aagccccaga 24C 
tttgacctgc aactcaaaga cctggaaaaa tggcagaata atctgcttcc atcccgccag 3CC 
tttggtttca ttgtactgac aacctcagct ggcatcatgg accatgaaga agcaagacga 36C 
aaacacacag gagggaaaat cctgggattc tttttctagg gatctaatac atatatttac 420 
aaataaaatg cctcatggac tctggtcctt ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa 480 
aaaaaaaaaa aaaaaaaaaa aaaaattaaa aaaaaaaaaa aaaaaaaaaa aaaaagcggc 54C 
c 541 



<210> 15 
<1'11> 66C 
<Z12> DNA 

<213> Homo sapiens 



<400> 15 

gccgccgycg aggattcagc agcctccccc ttgagccccc tcgcttcccg acgttccgtt 60 
cccccctgcc cgccttctcc cgccaccgcc gccgccgcct tccgcagccg tttccaccga 120 
ggaaaaggaa tegtategta tgtcegctat ccagaacctc cactctttcg acccctttgc 180 
tgatgcaagt aagggtgatg acctgcttcc tgctggcact gaggattata tccatataag 240 
aattcaacag agaaaeggea ggaagaccct tactactgtc caagggatcg ctcatgatta 300 
cgataaaaag aaactagtga aggcgtttaa gaaaaagttt gectgeaatg gtactgtaat 360 
tgagcatccg gaatatggag aagtaattca gctacagggt gaccaacgca agaacatatg 420 
ccagttcctc gtagagattg gactggctaa ggacgatcag ctgaaggttc atgggtttta 480 
agtgcttgtg gctcactgaa gcttaagtga ggatttcctt gcaatgagta gaatttccct 540 
tctctccctt gtcacaggtt Laaaaacctc acagcttgta taatgtaacc atttggggtc 600 
cgcttttaac ttggactagt gtaactcctt catgeaataa actgaaaaga gecatgeaaa 660 

<210> 16 

<211> 18 

<212> PRT ' 

<213> Home sapiens 



<400> 16 
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Asp Asp Tyr Asp Lys Lys Lys Leu Val Lys Ala Fhe Lys Lys Lys Phe 
1 5 10 ' " 15 

Ala Cys 



<210> 17 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Giu His Pro Glu Tyr Giy Ciu Val lie Gin Leu Gin Gly As P Gla Arg 

1 5 10 lb 

Lys Asn lie Cys 

20 



<210> 16 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<40C> 16 

Cys Gin Lys Ala Lys Asp Arg Met Glu Arg He Thr Arg Lys He Lys 
1 5 10 15 

Asp Ser Asp Ala Tyr Arg Arg Asp Glu 
20 * 25 



<210> 19 
<211> 19 
<212> PRT 

<213> Homo sapiens 



<4C0> 19 

Pro Arg Pro Arg Asp Lys Arg Gin Leu Leu Asp Pro Pro Giy Asp Leu 
1 5 io is 



Ser Arg Cys 



<210> 20 
<211> 15 
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<212> PRT 

<213> Hono sapiens 
<400> 20 

Cys Gly Val Trp Asn Gin Thr Glu Pro Glu Pro Ala Ala Thr Ser 
1 5 io 15 



<210> 21 
<211> 16 
<212> PRT 

<213> Homo sapiens 



<400> 21 

His His His 



i.eu Arg Met Ser Pro Leu Phe lys Cys 
10 i c 



<210> 22 
<21i> 16 
<212> FRT 

<213> Homo sapiens 



<400> 22 

Pro Cys Pro Glu Leu Ala Cys Pro Arg Glu Glu Trp Arg Leu Gly Pre 

1 ^ 10 n : 



<210> 23 
<211> 24 
<212> PRT 

<213> Homo sapiens 



<400> 23 

Asp Pro Ser Arg Ser Pro His Ser Thr Ser Ser Phe Pro Arg Gly Ser 
1 5 io lb 

Ser Ala Thr Ser Cys Asp Ser Arg 
20 



<210> 24 
<211> 21 
<212> PRT 

<213> Homo sapiens 

<400> 24 

His Pro Pro Asp Gly Ser Phe Ser Thr Phe His Asp Gly Pro Gin Pro 
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1 



5 



10 



15 



Leu Glu Asp Pro Cys 
20 



<210> 25 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 25 

Lys Scr He Asn Asn Ala Glu Lys Arg Gly Lys Arg Cys 
15 in 

<21C> 26 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<40C> 26 

Asp His Glu Giu Arc Arq Arg Lys His Thr Gly Gly Lys Cys 



i 
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